Login



Other Articles by Author(s)

Amjad Jumaah Frhan



Author(s) and WSEAS

Amjad Jumaah Frhan


WSEAS Transactions on Signal Processing


Print ISSN: 1790-5052
E-ISSN: 2224-3488

Volume 13, 2017

Notice: As of 2014 and for the forthcoming years, the publication frequency/periodicity of WSEAS Journals is adapted to the 'continuously updated' model. What this means is that instead of being separated into issues, new papers will be added on a continuous basis, allowing a more regular flow and shorter publication times. The papers will appear in reverse order, therefore the most recent one will be on top.



Hierarchical Agglomerative Clustering Algorithm Based Real-Time Event Detection from Online Social Media Network

AUTHORS: Amjad Jumaah Frhan

Download as PDF

ABSTRACT: Event detection from online social networks based on the user behaviour has been a research area which has garnered immense attention in the recent years. Many works have been developed for event detection in multiple social media sources like Twitter, Facebook, YouTube, etc. The user updates including short texts, photos and videos can be utilized in detecting the events. However detecting the number of common events from the social media content requires efficient distinguishing as the size of the content and number of users is large, leading to large data. In this paper, a new approach is proposed named as Event WebClickviz that performs the dual functions of visualization and behavioural analysis based on which the events are detected. In this approach, the event detection problem is modelled as clustering problem. Named Entity recognition with Topical PageRank is employed for extracting the key terms in the texts while the temporal sequences of real values are estimated to build the event sequences. The features are extracted by applying the concept of sentiment analysis using term frequency–inverse document frequency (TF-IDF). Based on these features the content is clustered using Hierarchical Agglomerative clustering algorithm. Thus the event is detected with high efficiency and they are visualized better using the proposed model. The simulation results justify the performance of the proposed Event WebClickviz.

KEYWORDS: Event detection, visualization, Named Entity recognition, Topical PageRank, Hierarchical Agglomerative clustering, term frequency–inverse document frequency.

REFERENCES:

[1] Go, A., Huang, L., & Bhayani, R. (2009). Twitter sentiment analysis. Entropy, 17, 252.

[2] Bifet, A., & Frank, E. (2010). Sentiment knowledge discovery in twitter streaming data. In International conference on discovery science (pp. 1-15). Springer, Berlin, Heidelberg.

[3] Kouloumpis, E., Wilson, T., & Moore, J. D. (2011). Twitter sentiment analysis: The good the bad and the omg!. Icwsm, 11(538-541), 164.

[4] Tumasjan, A., Sprenger, T. O., Sandner, P. G., & Welpe, I. M. (2010). Predicting elections with twitter: What 140 characters reveal about political sentiment. ICWSM, 10(1), 178-185.

[5] Mathioudakis, M., & Koudas, N. (2010). Twittermonitor: trend detection over the twitter stream. In Proceedings of the 2010 ACM SIGMOD International Conference on Management of data(pp. 1155-1158). ACM.

[6] Cha, M., Haddadi, H., Benevenuto, F., & Gummadi, P. K. (2010). Measuring user influence in twitter: The million follower fallacy. Icwsm, 10(10-17), 30.

[7] Ghosh, S., Sharma, N., Benevenuto, F., Ganguly, N., & Gummadi, K. (2012). Cognos: crowdsourcing search for topic experts in microblogs. In Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval (pp. 575-590). ACM.

[8] Chen, K., Chen, T., Zheng, G., Jin, O., Yao, E., & Yu, Y. (2012). Collaborative personalized tweet recommendation. In Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval (pp. 661-670). ACM.

[9] Galuba, W., Aberer, K., Chakraborty, D., Despotovic, Z., & Kellerer, W. (2010). Outtweeting the twitterers-predicting information cascades in microblogs. WOSN, 10, 3-11.

[10] Kwak, H., Lee, C., Park, H., & Moon, S. (2010). What is Twitter, a social network or a news media?. In Proceedings of the 19th international conference on World wide web (pp. 591-600). ACM.

[11] Signorini, A., Segre, A. M., & Polgreen, P. M. (2011). The use of Twitter to track levels of disease activity and public concern in the US during the influenza A H1N1 pandemic. PloS one, 6(5), e19467.

[12] Dong, A., Zhang, R., Kolari, P., Bai, J., Diaz, F., Chang, Y., & Zha, H. (2010). Time is of the essence: improving recency ranking using twitter data. In Proceedings of the 19th international conference on World Wide Web (pp. 331-340). ACM.

[13] Starbird, K., & Palen, L. (2012). (How) will the revolution be retweeted?: information diffusion and the 2011 Egyptian uprising. In Proceedings of the acm 2012 conference on computer supported cooperative work (pp. 7-16). ACM.

[14] Alkazemi, M. F., Bowe, B. J., & Blom, R. (2012). Facilitating the egyptian uprising: A case study of facebook and. Cases on Web, 2, 256.

[15] Mills, A., Chen, R., Lee, J., & Raghav Rao, H. (2009). Web 2.0 emergency applications: How useful can Twitter be for emergency response?. Journal of Information Privacy and Security, 5(3), 3-26.

[16] Becker, H., Naaman, M., & Gravano, L. (2011). Beyond Trending Topics: Real-World Event Identification on Twitter. ICWSM, 11(2011), 438-441.

[17] Petrović, S., Osborne, M., & Lavrenko, V. (2010). Streaming first story detection with application to twitter. In Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics (pp. 181-189). Association for Computational Linguistics.

[18] Valkanas, G., & Gunopulos, D. (2013). How the live web feels about events. In Proceedings of the 22nd ACM international conference on Information & Knowledge Management (pp. 639- 648). ACM.

[19] Abdelhaq, H., Sengstock, C., & Gertz, M. (2013). Eventweet: Online localized event detection from twitter. Proceedings of the VLDB Endowment, 6(12), 1326-1329.

[20] Frhan, A. J. (2017). Website Clickstream Data Visualization Using Improved Markov Chain Modelling In Apache Flume. In MATEC Web of Conferences (Vol. 125, p. 04025). EDP Sciences.

[21] Al-Ariki, H. D. E., & Swamy, M. S. (2017). A survey and analysis of multipath routing protocols in wireless multimedia sensor networks. Wireless Networks, 23(6), 1823-1835.

[22] He, Q., Chang, K., & Lim, E. P. (2007). Analyzing feature trajectories for event detection. In Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval (pp. 207- 214). ACM.

[23] Weng, J., & Lee, B. S. (2011). Event detection in twitter. International Conference on Weblogs and Social Media 11, 401-408.

[24] Li, R., Lei, K. H., Khadiwala, R., & Chang, K. C. C. (2012). Tedas: A twitter-based event detection and analysis system. In Data engineering (icde), 2012 IEEE 28th international conference on (pp. 1273-1276). IEEE.

[25] Ciglan, M., & Nørvåg, K. (2010). WikiPop: personalized event detection system based on Wikipedia page view statistics. In Proceedings of the 19th ACM international conference on Information and knowledge management (pp. 1931-1932). ACM.

[26] Ahn, B. G., Van Durme, B., & CallisonBurch, C. (2011). WikiTopics: What is popular on Wikipedia and why. In Proceedings of the Workshop on Automatic Summarization for Different Genres, Media, and Languages (pp. 33- 40). Association for Computational Linguistics.

[27] Liu, Z., Huang, W., Zheng, Y., & Sun, M. (2010). Automatic keyphrase extraction via topic decomposition. In Proceedings of the 2010 conference on empirical methods in natural language processing (pp. 366-376). Association for Computational Linguistics.

[28] Chowdhury, G. G. (2010). Introduction to modern information retrieval. Facet publishing.

WSEAS Transactions on Signal Processing, ISSN / E-ISSN: 1790-5052 / 2224-3488, Volume 13, 2017, Art. #24, pp. 215-222


Copyright © 2017 Author(s) retain the copyright of this article. This article is published under the terms of the Creative Commons Attribution License 4.0

Bulletin Board

Currently:

The editorial board is accepting papers.


WSEAS Main Site